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Abstract 

Consider a setting where possibly sensitive information sent over a path in a 
network is visible to every neighbor of the path, i.e., every neighbor of some node 
on the path, thus including the nodes on the path itself. The exposure of a path P 
can be measured as the number of nodes adjacent to it, denoted by N[P]. A path 
is said to be secluded if its exposure is small. A similar measure can be applied to 
other connected subgraphs, such as Steiner trees connecting a given set of terminals. 
Such subgraphs may be relevant due to considerations of privacy, security or revenue 
maximization. This paper considers problems related to minimum exposure con- 
nectivity structures such as paths and Steiner trees. It is shown that on unweighted 
undirected n-node graphs, the problem of finding the minimum exposure path con- 
necting a given pair of vertices is strongly inapproximable, i.e., hard to approximate 
within a factor of 0(2^°^^ for any e > (under an appropriate complexity as- 
sumption), but is approximable with ratio \/A + 3, where A is the maximum degree 
in the graph. One of our main results concerns the class of bounded-degree graphs, 
which is shown to exhibit the following interesting dichotomy. On the one hand, the 
minimum exposure path problem is NP-hard on node-weighted or directed bounded- 
degree graphs (even when the maximum degree is 4). On the other hand, we present 
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a polynomial algorithm (based on a nontrivial dynamic program) for the problem 
on unweighted undirected bounded-degree graphs. Likewise, the problem is shown 
to be polynomial also for the class of (weighted or unweighted) bounded-treewidth 
graphs. Turning to the more general problem of finding a minimum exposure Steiner 
tree connecting a given set of k terminals, the picture becomes more involved. In 
undirected unweighted graphs with unbounded degree, we present an approxima- 
tion algorithm with ratio min{A, n/fc, \/2n, 0(log A: • {k + \/A))}. On unweighted 
undirected bounded-degree graphs, the problem is still polynomial when the number 
of terminals is fixed, but if the number of terminals is arbitrary, then the problem 
becomes NP-hard again. 



1 Introduction 



The problem. Consider a setting where possibly sensitive information sent over a path 
in a network is visible to every neighbor of the path, i.e., every neighbor of some node on 
the path, thus including the nodes on the path itself. The exposure of a path P can be 
measured as the size (possibly node-weighted) of its neighborhood in this sense, denoted 
by N[P]. A path is said to be secluded if its exposure is small. A similar measure can 
be applied to other connected subgraphs, such as Steiner trees connecting a given set 
of terminals. Our interest is in finding connectivity structures with exposure as low as 
possible. This may be motivated by the fact that in real-life applications, a connectivity 
structure operates normally as part of the entire network G (and is not "extracted" from 
it), and so controlling the effect of its operation on the other nodes in the network may be 
of interest, in situations in which any "activation" of a node (by taking it as part of the 
structure) leads to an activation of its neighbors as well. In such settings, to minimize the 
set of total active nodes, we aim toward finding secluded or sufficiently private connectivity 
structures. Such subgraphs may be important in contexts where privacy is an important 
concern, or in settings where security measures must be installed on any node from which 
the information is visible, making it desirable to minimize their number. Another context 
where minimizing exposure may be desirable is when the information transferred among 
the participants has commercial value and overexposure to "free viewers" implies revenue 
loss. 

This paper considers the problem of minimizing the exposure of subgraphs that satisfy 
some desired connectivity requirements. Two fundamental connectivity problems are con- 
sidered, namely, single-path connectivity and Steiner trees, formulated as the Secluded 
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Path and Secluded Steiner Tree problems, respectively, as follows. Given a graph 
G = (y, E) and an 5, t pair (respectively, a terminal set 5), it is required to find an 5 — t 
path (respectively, a Steiner tree) of minimum exposure. 

Related Work. The problems considered in this paper are variations of the classical 
shortest path and Steiner tree problems. In the standard versions of these problems, 
a cost measure is associated with edges or vertices, e.g., representing length or weight 
and the task is to identify a minimum cost subgraph satisfying the relevant connectivity 
requirement. Essentially, the cost of the solution subgraph is a linear sum of the solution's 
constituent parts^ i.e., the sum of the weights of the edges or vertices chosen. 

In contrast, in the setting of labeled connectivity problems, edges (and occasionally 
vertices) are associated with labels (or colors) and the objective is to identify a subgraph 

C G that satisfies the connectivity requirements while minimizing the number of used 
labels. In other words, costs are now assigned to labels rather than to single edges. Such 
labeling schemes incorporate grouping constraints, based on partitioning the set of avail- 
able edges into classes, each of which can be purchased in its entirety or not at all. These 
grouping constraints are motivated by applications from telecommunication networks, 
electrical networks, and multi-modal transportation networks. Labeled connectivity prob- 
lems have been studies extensively from complexity-theoretic and algorithmic points of 
view [Zl^lEl[9j. The optimization problems in this category include, among others, the 
Minimum Labeled Path problem [m[23]. the Minimum Labeled Spanning Tree problem 
[T5l|TT], the Minimum Labeled Cut problem [24j, and the Labelled Prefect Matching 
problem [20j. 

In both the traditional setting and the labeled connectivity setting, only edges or nodes 
that are explicitly part of the selected output structure are "paid for" in solution cost. 
That is, the cost of a candidate structure is a pure function of its components, ignoring 
the possible effects of "passive" participants, such as nodes that are "very close" to the 
structure in the input graph G. In contrast, in the setting considered in this paper, the 
cost of a connectivity structure G^ is a function not only of its components but also of 
their immediate surroundings, namely, the manner in which G^ is embedded in G plays a 
role as well. (Alternatively, we can say that the cost is a not necessarily a linear function 
of its components.) 

To the best of our knowledge, secluded connectivity problems have not been consid- 
ered before in the literature. The Secluded Path and Secluded Steiner Tree problems 
are related to several existing combinatorial optimization problems. These include the 



3 



Red-Blue Set Cover problem [HET], the Minimum Labeled Path problem [HI [23] and 
the Steiner Tree [13j and Node Weighted Steiner Tree problems [14j. A prototypi- 
cal example is the Red-Blue Set Cover problem, in which we are given a set R of red 
elements, a set B of blue elements and a family S C 2l^l^l^l of subsets of blue and red 
elements, and the objective is to find a subfamily CCS covering all blue elements that 
minimizes the number of red elements covered. This problem is known to be strongly 
inappr oximable . 

Finally, turning to geometric settings, similarly motivated problems have been studied 
in the networking and sensor networks communities, where sensors are often modeled as 
unit disks. For example, the Maximal Breach Path problem [19] is defined in the context 
of traversing a region of the plane that contains sensor nodes at predetermined points, and 
its objective is to maximize the minimum distance between the points on the path and 
the the sensor nodes. (The solution uses edges of the sensor nodes' Voronoi diagram.) A 
dual problem studied extensively is barrier coverage^ i.e., the (deterministic or stochastic) 
placement of sensors in order to make it difficult for an adversary to cross the region 
unseen (see [16j and the references therein). [5j studies fc-barrier coverage, in which the 
task is to position the sensors so that any path across the region will intersect with at least 
k sensor disks. Similarly motivated problems have been studied in the context of path 
planning in AI. "Stealth" path planning problems, in which the task is to find a minimum 
"visibility" path from source to destination, have been considered in [121 [TTl [I8] . Although 
the motivation is similar, such problems are technically quite different from the graph- 
based problems studied here; those problems are typically posed in the geometric plane, 
amid obstacles that cause occlusion, and visibility is defined in terms of line-of-sight. 

Contributions. In this paper, we introduce the concept of secluded connectivity and 
study some of its complexity and algorithmic aspects. We first show that the Secluded 
Path (and hence also Secluded Steiner Tree) problem is strongly inapproximable on 
unweighted undirected graphs with unbounded degree (more specifically, is hard to ap- 
proximate with ratio 0(2^^^^ ^^), where n is the number of nodes in the graph G, assuming 
AfV 2 VTXMS (nP^iyiogn^^^ Conversely, we devise a + 3 approximation algorithm 
for the Secluded Path problem and a min{A, n//c, v^2n, 0(log k - {k + v^))} approxima- 
tion algorithm for the Secluded Steiner Tree problem, where A is the maximum degree 
in the graph and k is the number of terminals. 

One of our key results concerns bounded-degree graphs and reveals an interesting 
dichotomy. On the one hand, we show that Secluded Path is NP-hard on the class of 
node-weighted or directed bounded-degree graphs, even if the maximum degree is 4. In 
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contrast, we show that on the class of unweighted undirected bounded-degree graphs, the 
Secluded Path problem admits an exact polynomial-time algorithm, which is based on 
a complex dynamic programming and requires some nontrivial analysis. Likewise, the 
Secluded Steiner Tree problem with fixed size terminal set is in P as well. 

Finally, we consider some specific graph classes. We show that the Secluded Path and 
Secluded Steiner Tree problems are polynomial for bounded-treewidth graphs. We also 
show that the Secluded Path (resp.. Secluded Steiner Tree) problem can be approx- 
imated with ratio 0(1) (resp., 6(logA;)) in polynomial time for hereditary graph classes 
of bounded density. As an example, the Secluded Path problem has a 6 approximation 
on planar graphs. (A more careful direct analysis of the planar case yields ratio 3.) 

Preliminaries. Consider a node- weighted graph G{V^ E^W)^ for some weight function 
W : V ^ ]R>o, with n nodes and maximum degree A. For a node u G let N{u) = 
{v G V \ (u^v) G E} be the set of u's neighbors and let N[u] = N{u) U {u} be 
dosed neighborhood^ i.e., including u itself. A path is a sequence P = [2/1, . . . , oriented 
from left to right, also termed di Ui — U£ path. Let P[i] = Ui for i G {1, . . . ,^}. Let 
First{P) = Ui and Last{P) = u^. For a path P and nodes x^y on it, let P[x^y\ be the 
subpath of P from x to y. For a connected subgraph C G and for Ui, Uj G V{G'), let 
distc^ (ui^Uj) be the distance between Ui and uj in G\ Let N{G') = Uwgg^ N{u) \ G' be 
the nodes that are strictly neighbors of G' nodes and N[G'] = {J^eG' be the set of 
nodes in the 1-neighborhood of G'. Define the cost of G' as 



Note that if G is unweighted, then the cost of a subgraph G^ is simply the cardinality of 
the set of G^ nodes and their neighbors, Cost(G') = |A^[G']|. 

We sometimes consider the neighbors of node u G V{G) in different subgraphs. To 
avoid confusion, we denote Ng'{u) the neighbors of u restricted to graph G' . 

For a subgraph G' C G, let DegCost(G') denote the sum of the degrees of the nodes of 
G' . If G' is a path, then this key parameter is closely related to our problem. It is not hard 
to see that for any given path P, Cost(P) < DegCost(P). The problem of finding an 5 — t 
path P with minimum DegCost(P) is polynomial, making it a convenient starting-point 
for various heuristics for the problem. 

In this paper we consider two main connectivity problems. In the Secluded Path 
problem we are given an unweighted graph GiV^E)^ a source node s and target node 
t, and the objective is to find an 5 — t path P with minimum neighborhood size. A 




W{u) . 



(1) 



«eAf[G'] 
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generalization of this problem is the Secluded Steiner Tree problem, in which instead 
of two terminals s and t we are given a set of k terminal nodes S and it is required 
to find a tree T in G covering 5, of minimum neighborhood size. If the given graph 
G is weighted, then the weighted Secluded Path and Secluded Steiner Tree problems 
require minimizing the neighborhood cost as given in Eq. Q. We now define these 
tasks formally. For an 5 — t pair, let Vg^t = {P | is a s-t path} be the set of all 5 — t 
paths and let g*^ = min{Cost(P) | P G Vg^t} be the minimum cost among these paths. 
Then the objective of the Secluded Path problem is to find a P* G Vs,t that attains this 
minimum, i.e., such that Cost(P*) = g*^. For the Secluded Steiner Tree problem, let 
T{S) = {T C G \ S C V{T)^T is a tree} be the set of all trees in G covering 5, and let 
be the minimum cost among these trees, i.e., 

q'{S) = min{Cost(r) | T G T{S)} . (2) 

Then the solution for the problem is a tree T* G T{S) such that Cost(r*) = 

Notation For a graph G = {V^ E) and a set S oi k terminals, let T be some Steiner tree 
(spanning all terminals). Consider a decomposition of the tree into 0{k) subpaths S(r) 
defined as follows. Let B'^iT) = Su{u G V{G) \ \Nt{u)\ > 3} be the set of terminals and 
nodes with more than three neighbors in T (thus branching points in T). For a, 6 G B''{T)^ 
let a = [a, 6] be some maximal path in T such that V{a) D B''{T) = {a, 6}, i.e., a is a 
maximal subpath in T whose internal section is free of B*{T) nodes. Since any internal 
node in a is of degree at most 2, a is indeed a path in T. Let S(r) = {a = [a, b] | a, 6 G 
S*(T)} be the collection of these maximal subpaths. Note that |S(r)| < 2k. 



2 Unweighted Undirected Graphs with Unbounded 
Degree 

Hardness of approximation. 

Theorem 2.1 On unweighted undirected graphs with unbounded degree^ the Secluded 
Path problem (and hence also the Secluded Steiner Tree problem) is strongly inap- 
proximable. Specifically, unless MV C VTIA4S{n^^^y^^^^^'^), the Secluded Path problem 
cannot be approximated to within a factor 0(2^^^^ for any e > 0. 

We show this for the simple case where S = {s^t}. The theorem is shown by a gap- 
preserving reduction from the Red-Blue Set Cover (RBSC) problem [2T]. An instance 
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of Red-Blue Set Cover consists of a set 5 = {61, ... , b\B\} of blue elements, a set i? = 
{^1, • • • 5 of red elements, and a collection S = {S'l, . . . , S^} of subsets oi B U R. 
The task is to choose a family of sets covering all blue elements but a minimum number 
of red elements. In [4J it is shown that the Red-Blue Set Cover problem cannot be 
approximated to within an 0(2^^^'"'^) ratio unless XV C PrXA^^(n^^^2/iog(n)^^ ^^^^.^ 
n = max{|i?|, \S\}. Given a Red-Blue Set Cover instance, we construct a Secluded 
Path instance (G^s^t) as follows. For each 6^, let Si = {Sj G | 6^ G Sj} be the sets 
containing 6^. For every i,^, add to G a node Vij corresponding to Sj G Si. Define 
= ^SjeSii^hj} ^ ^ [Ji^i- ^dd edges so that the sets Vi and V^+i form a complete 
bipartite graph for every 1 < i < \B\. In addition, we connect a node s to all nodes 
in Vi, and connect a node t to all nodes in V\b\- So far, our construction contains X = 
Ei + 2 < + 2 nodes (see Fig. Q. 

We now turn to describing the representation of the red elements R. For every G i?, 
let Ci be a "supernode" consisting of nodes (with no edges among them), and let 
C = 1J1=i be the set of all supernodes. All nodes of are connected to node Vij if 
n ^ Sj, for every ^ G {1, . . . , \R\}, i G {1, . . . , and j G {1, . . . , \S\}. Finally, let H 
be a "hypernode" consisting of nodes (with no edges among them), all of which are 
connected to all the constituent nodes of all supernodes. This completes the construction 
of G. Overall, \V{G)\ = 0{n^). 

By "visiting" a supernode or the hypernode, we mean visiting any of their constituent 
nodes. We now make an immediate observation which allows us to restrict ourselves to 
s — t paths P not visiting any supernode or the hypernode, i.e., with V{P) C V. 

Claim 2.1 No uj{l/n)- approximate secluded path solution will visit any supernodes or the 
hypernode, i.e., will satisfy V{P) nC = 0. 

Proof: For every path P that goes through a node G CU'H it holds that Cost(P) > n^. 
On the other hand, a path P' that visits only V nodes, i.e., V{P') C V costs only 
Cost(PO <X + \R\'n^ = 0{n^). The claim follows. | 

Let I{P) = {i \ Ci ^ N[P]] be the indices of the supernodes in the neighborhood of 
path P. Then 

Cost(P) = X + |7(P)|.n^ 
We first show the correctness of the reduction and then consider gap-preservation. 

Claim 2.2 There exists an s — t path of cost X + kn^ iff there exists an Red-Blue Set 
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Cover solution of cost k. 



Proof: (^) By Claim 2.1, we restrict ourselves to paths not visiting any supernodes. 
Thus, by the structure of the graph, any such s — t path visits one of the nodes Vij for 
each I < i < \B\^ corresponding to the selection of Sj to cover bi. Since all X ordinary 
nodes will be in N{P)^ kn? of the cost is due to the size-n^ supernodes, which correspond 
to k red elements covered in the Red-Blue Set Cover solution. 

(<^) Conversely, given a cost-/c Red-Blue Set Cover solution, we construct the path 
as follows. Starting from 5, for every 1 < i <\B\^ add to P a node Vi^j such that Sj is in 
the Red-Blue Set Cover solution, and then finally add t. The cost due to the ordinary 
nodes is X and due to the k supernodes is kn? . | 



We now show that the reduction is gap-preserving. By Claim 2/2_ it follows that 
g*^ = X + • n^, where is the optimal Red-Blue Set Cover cost (the number of 
red elements W covered by the optimal solution). Assume that there exists an a > 0{l/n) 
approximation algorithm for the Secluded Steiner Tree problem. This would result in 
path a P such that 

Cost(P) = X + |7(P)| . < a{X + • n^) < 2a • • . 

Thus |/(i^)| < 2Q{|i?*|, implying that (Cost(P)— X)/n^ is a 2a approximation to Red-Blue 
Set Cover. Since = n^, and as Red-Blue Set Cover is inapproximable within a 

factor of 0(2^^g ^'^) for every fixed e, we get that Secluded Path is inapproximable within 
a factor of 0(2^^^ ^ ^), where = /(e) for some appropriate function /. This complete 
the proof of Thm. 



2.1 



I 



Corollary 2.2 The Secluded Path problem (and hence also the Secluded Steiner Tree 

problem) is strongly inapproximable in directed acyclic graphs. 



Proof: The proof is by gap-preserving reduction from Red-Blue Set Cover, very similar 
to the general case. The key modifications from the general case are that the hypernode 
of nodes is not included and we add directions to the edges; directing the edges from 
s toward the nodes of aSi, namely, Vi, from the nodes of Vi to the nodes of V^+i for 
every i G and from the nodes of V\b\ to the node t. We also direct all the 

supernodes edges towards the supernode elements. It can be verified that this directed 
graph is acyclic. The rest of the analysis is as in the undirected case. The graph G 
contains O(n^) nodes, thus the Secluded Path problem cannot be approximated within 
an 0(2^^^^ ^^) ratio. The corollary follows. | 
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Figure 1: Example of the gap- preserving reduction from Red-Blue Set Cover to Secluded 
Path. The Red-Blue Set Cover instance is given by Si = {&i, &2, ^i, '^3}, 5^2 = 
{b4,r4},Ss = {fea, ^1,^5}, 5^4 = {fei, ^3, ^4, ^2, r4}, S's = ^2, ^5, ^3, ^s}- The optimal 
solution for Red-Blue Set Cover is {S'l, 6^3, S's}, which covers 3 red elements. The cor- 
responding s — t path is of cost 14 + 3-5^ = 389. Note that an alternative solution of 
{Si, S4} is of higher cost, covering 4 red elements, and the corresponding s — t path is of cost 
14 + 4-5^ = 514. 

2.1 Approximation 

Theorem 2.3 The Secluded Path problem in unweighted undirected graphs can be ap- 
proximated within a ratio of ^/A + 3. 

Proof: Given an instance of the Secluded Path problem, let P* be an 5 — t path that 
minimizes Cost(P*). Note that we may assume without loss of generality that for every 
node 2/ in P*, the only neighbors oi u in G from among the nodes of V{P*) are the nodes 
adjacent to 2/ in P*. To see this, note that otherwise, if u had an edges to some neighbor 
G such that e is not on P*, we could have shortened the path P* (by replacing 

the subpath from u to with the edge e) and obtained a shorter path with at most the 
same cost as P*. 

Recall that DegCost(P) denotes the sum of the node degrees of the path P, and that 
Cost(P) < DegCost(P) for any P. Recall also that the problem of finding an 5 — t path P 
with minimum DegCost(P) is polynomial. We claim that the algorithm that returns the 
path (5* minimizing DegCost((5*) yields a (a/A + 3) approximation ratio for the Secluded 
Path problem. 
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In order to prove this, we show there exists a path Q such that DegCost((5) < (\/A + 
3) • Cost(P*). This imphes that DegCost(Q*) < DegCost(Q) < (VA + 3) • Cost(P*), as 
required. 

The path Q is constructed by the following iterative process. Initially, all nodes are 
unmarked and we set Q — P"" . While there exists a node with more than \/A + 3 
unmarked neighbors on the path pick such a node x. Let y be the first (closest to 
s) neighbor of x in Q and let z be the last (closest to t) neighbor of x in Replace 
the subpath 2;] in Q with the path [y^x^z] and mark the node x. We now show that 
DegCost((5) < (\/A + 3) •Cost(P*). Let X — {xi, . . . x^} be the set of marked nodes in 
where Xi is the node marked in iteration i. Note that I — \X\ is the number of iterations 
in the entire process. 

For the sake of analysis, partition A^[P*] into two sets: S'l, those that have no unmarked 
neighbor in and 82^ those that do. We claim that \Si\ > £ • a/A. 

For each Xi G X, let P{xi) be the path that was replaced by the process of constructing 
Q in iteration and let Q{xi) be the path Q in the beginning of that iteration. Since Xi 
has more than a/A + 3 unmarked neighbors in Q{xi)^ we get that \P{xi) H P*| > \/A + 4. 
Let P~{xi) be the path obtained by removing the first two nodes and last two nodes 
from P{xi). Note that \P~{xi) n P*| > VA. In addition, the sets P~{xi) n P* for 
i = {1, . . . , ^} are pairwise disjoint, and moreover, none of the nodes in Q has a neighbor in 
p-{xi)nP\ We thus get that P-(x^)nP* C Si. Note that Cost (P*) = |5i| + |52| and that 
I^iI/VA. WegetthatDegCost(g) < |52K\/A+3)+^-A < |52KVA+3) + |5i|- VA < 
(1^21 + l^il) • (\/A + 3) = Cost(P*) . (\/A + 3). I 

In addition, for the Secluded Steiner Tree problem with k terminals, we establish 
the following. 

Theorem 2.4 The Secluded Steiner Tree problem in unweighted undirected graphs can 
he approximated within a ratio o/min{A, n/fc, v^2n, 0(logfc • {k + a/A))}. 

Recall that DegCost(r) is the sum of degrees in the tree T and let DegCost*(<S) = 
min{DegCost(r) | T C G spans S}. Clearly, Cost(T) < DegCost(r). 
The Min Degree Steiner Tree problem is defined as follows. Given a graph G — (1/, E) 
and a set of terminals 5, the objective is to find a Steiner tree T of minimal DegCost(r), 
i.e., such that DegCost(r) = DegCost*(5). 

Claim 2.3 The Min Degree Steiner Tree problem is NP-hard for arbitrary k = \S\ 
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and can be approximated within a ratio ofQ{logk). Ifk is constant^ thenKin Degree Steiner Tree 
is polynomial 

Proof: We first present an 0(log/c) approximation algorithm. In the Node Weighted 
Steiner Tree problem, nonnegative costs are assigned to nodes as well as to edges. The 
cost of subgraph G' that contains the terminals S is the sum of the costs of its nodes 
and edges. While constant factor approximations exists for the standard Steiner Tree 
problem (of edge- weighted graphs), the Node Weighted Steiner Tree problem can- 
not be approximated to within less than a logarithmic factor 0(\ogk) assuming that 
NP ^ DTIME{n^^^''^^''^''^), by a reduction from Set Cover [HlIIl]. Using the 0(logA;)- 
approximation algorithm of [8j, the Min Degree Steiner Tree problem can be solved as 
follows. Set the weight of each vertex to be its degree, that is W{v) = \N{v)\^ for every 
V G V{G)^ and set zero weights to the edges. The optimal Node Weighted Steiner Tree 
tree with these weights corresponds to the optimal Min Degree Steiner Tree tree. Since 
Node Weighted Steiner Tree is polynomial for fixed fc, so is Min Degree Steiner Tree 

LMJ. 

We now show that this approximation is tight. For brevity, let 11 denote the special 
case of the Node Weighted Steiner Tree problem in which edges have zero weights and 

denote the Min Degree Steiner Tree task. The approximation-preserving reduction 
from Set Cover presented in [14j assigns zero weights to the edges, and hence 11 is inap- 
proximable within logarithmic factor. By reducing from IT to IT' we will show that 11' is 
inapproximable within logarithmic factor as well. Given a 11 instance graph G{V^ W) 
and terminal set 5, where W{vi) > 0, we transform it into an instance of ^^ We 

assume that the weights are polynomial. To obtain G', the weights W are scaled (multi- 
plied by a common large enough factor X) to weights W such that W\vi) > \NG{vi) \ for 
every node Vi gV. Next, define the residual of vertex Vi as Res{vi) = W\vi) — \NG{vi)\- 
The scaling step guarantees that Res{vi) > 0. Next, add to each vertex Vi a disjoint 
set T{vi) = {vf^j G [1^ Res{vi)]} consisting of 1-weight Res{vi) neighbors. Formally, 
G' = (y^ W^), where r = V U.^^y T(^,) and = E U {{v,, vj), vj G T{v,),v, G V}. 
Finally, W\vi) = \NG'{vi)\ for every Vi G V and W\u) = 1 for every u G V \ V . It is 
easy to see that optimal solution of H' in G^ corresponds to an optimal solution for H in 
G. Since the weights W are polynomial (hence so is X) it follows that a Q{logk) approx- 
imation for n' in G^ would imply an Q{logk) approximation for H in G, in contradiction 
to the logarithmic inapproximability of H' due to [14] . | 

We now show the following. 
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Lemma 2.5 DegCost*(<S) = e(\/A + fc) • q%S). 



Proof: Let T* be an optimal secluded Steiner tree, meaning Cost(T*) = Starting 
with T* we will perform a set of operations on it, ending with a spanning S tree T for 
which DegCost(f) < 6(7^ + A;)Cost(r*). Since DegCost(f) > DegCost*(5) this would 
establish that DegCost*(iS) < 0(\/A + k) • We begin by describing the procedure 

for obtaining T from T* and then analyze its cost, i.e., DegCost(T). Note that as we do 
not know T*; this procedure is only theoretical to aid the analysis and is not meant for 
implementation. 

Let A* = \/A + Ak. Throughout, we assume that A > A*. Else, any 0(1)- 
approximation algorithm for the Steiner Tree problem is a 0(A)-approximation algo- 
rithm to the Secluded Steiner Tree problem. Initially, let all nodes in T* be unmarked 
and set T = T*. A node u is high-degree with respect to T if it has at least A* unmarked 
neighbors in T. While there is a high-degree node x with respect to T, do the following. 
For each subpath a — [a, h] G S(T*) such that \N{x) fl V{a)\ > 2, i.e., x has at least two 
neighbors in a, replace the following subpaths. Let yi (respectively, 1/2) be the neighbor of 
X closest to a (respectively, b) on T*. Replace the subpath [2/1,^2] ^ <^ with the subpath 
[?/i,x,?/2] and mark the node x. 

We now show that DegCost(f ) < 0{^/A + k)q%S). Let M = {ui, ...,Uy}he the set 
of marked nodes in T. This implies that the procedure consists of |M| = y iterations. Let 
To = T* and be the spanning S subgraph at the end of iteration i for i = 1, |M| 
(thus Ty = T). The node G M is the high-degree node with respect to T^_i that was 
observed at iteration i. Let M = V{T) \ M be the unmarked nodes in the final subgraph 
T. Note that T might not be a tree (due to cycle introduced by the marked nodes), but 
clearly, a subgraph of it is a tree spanning all terminals. By definition, the DegCost of T 
is given by 

DegCost (f) = DegCost (M) + DegCost (M) , (3) 

where DegCost(C/), for U C V, is given by DegCost(C/) = Y^ueu\^G{u)\' Let = 
{"^j I J ^ be the set of marked nodes at the end of iteration i. Given a path P — 
[xi, ^2, . . . , Xq-i^ Xq]^ Ict P~ = [^2, . . . , dcuotc its internal subpath. For every marked 
node Ui and a subpath a G S(T*) for which Ui has at least 3 unmarked neighbors in a 
at iteration i, let Pi{a) be the subpath in T^_i that was shortcut in by introducing Ui] 
otherwise let Pi{a) = 0. Let N~{ui) = NT^_^{ui) \ M^_i be the unmarked neighbors of 
Ui at iteration i and define "^{ui) = UaGS(T*) ^~('^^) ^ ^(^rl^))' ^he set of "internal" 
unmarked neighbors of Ui in T^_i (the nodes that are removed from T^_i due to path 
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replacements) . 

We now make two observations. (A) > a/A for every node Ui G M, and (B) 

^^(2/1),..., "i^iuy) are pairwise disjoint. We start with (A). Note that for every a G S(T*) 
on which Ui has unmarked neighbors, we ignore at most 2 nodes in ^^(t^i) (corresponding 
to the endpoints of the replaced path, Pi{a)). Since there are at most 2k subpaths in 
S(T*), there are at most 4/c unmarked neighbors of u in T^_i that are ignored in 'i^{ui). 
As Ui has A* unmarked neighbors in T^_i, we get that > A* — 4/c > a/A, which 

establishes (A). Note that (B) holds since each set ^^(t^i) C V{Ti_i) is part of the tree 
in iteration i but is not part of the tree in iteration z + 1, i.e. ^^(t^i) ^ y{Ti) (due to 
the subpath replacement step). We now refer to Eq. ^ and bound DegCost(M) and 
DegCost(M). First, observe that DegCost(M) < a/A •?*(5). This holds since every node 
Ui ^ M corresponds to a disjoint set "^{ui) C 1/(T*) of at least a/A nodes of the optimal 
tree T*. Thus, \M\ < T/(r*)/yA and \M\ < g*(5)/yA. Since the degree of each marked 
node is bounded by A, it follows that 

DegCost(M) < VA • q%S) . (4) 

We now bound the degree of the unmarked nodes M of T. Let A^* = Ng[M]. Since 
M C y(r*), it follows that A* C A[r*] and hence |A*| < g*(5). By the stopping 
criteria, it follows that any node in A* has at most A* neighbors in M. Hence 

DegCost(M) < A* .g*(5). (5) 

Combining Eqs. Q, and g, it follows that DegCost*(5) = 0{VA + k) • 

g*(5). Finally, we show that there exists a graph G and a set of terminals S for which 
DegCost*(5) = n{^/A + k) • g*(5). Set 3 < A < fc, graph G consists of 2fc + A - 5 nodes. 
Let S = {vq^ ...^Vk-i}. The skeleton for graph G is a path P = [vq^Ui^ . . . ^Uk-2^Vk-i]- 
Then, for every i G [1, — 2], connect Ui to the terminal Vi. In addition, connect each of 
the Ui nodes to the set of nodes Z = {zi, . . . , ^a-s}- The graph G is presented in Fig. 
[2| Clearly, the optimal tree T* under both measures, i.e., Cost(T*) and DegCost(T*), is 
given by the subgraph induced by G{V \ Z). Since every Z node is counted k — 2 times in 
DegCost(r*) instead of one time in Cost(r*), it follows that DegCost(r*) = A-{k-2) + k 
and Cost(r*) = 2k+A-5. Since > A, we get that DegCost(r*) = f](yA+fc)Cost(r*). 
The lemma follows. | 

Claim 2.4 The Secluded Steiner Tree problem on unweighted undirected graphs can 
be approximated within v^2n. 
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Proof: Let g he di guess for the solution of the optimal secluded Steiner tree. A 

Steiner tree Tg is constructed for each for each guess g G {1, . . . , n} as follows. Remove 
from G all vertices with degree > g and let Tg be a 2-approximation for the optimal 
Steiner tree (with minimum number of edges). Let = Tg* for 5^* = i.e., the 

2-approximation Steiner tree (where the objective is to minimize the number of edges) 
computed when considering the correct guess. We show that Cost(T^) < \/2n-g*(5'). Let 
L be the cost of the optimal Steiner tree (i.e., number of edges in the tree). Then clearly 
g*(5) > L. In addition, Cost(T') < 2L • g*(5). This follows, since our approximated 
Steiner tree contains at most twice the edges of the optimal one, and the degree of each 
vertex is bounded by (S) . Finally, since the cost of any secluded Steiner tree is bounded 
by the number of nodes n, we get that Cost{T') / q^'i^S) = min{n/g*(5), 2 • q^{S)} < v^2n. 
I 



We are now ready to complete the proof for Thm. 2.3 



Proof: First note that any 0(l)-approximation algorithm to the Steiner Tree problem is 
an 0(A)-approximation algorithm to Secluded Steiner Tree. Next, note that obtaining 
an n/k approximation is trivial, since Cost(T) < n for any spanning tree T and q*{S) > k. 



The approximation of v2n is thanks to Claim 2.4 and that of B(log/c(v A + k)) is due 



to Claims 2.5 and 2.3, Specifically, let A be an B(log/c) approximation algorithm for the 
Min Degree Steiner Tree problem, and let T be the resulting spanning S tree output 
by algorithm A given graph G and S as input. Due to Claim |2.3[ such an A exists, 
and hence DegCost(T) < B(log /c)DegCost*(5). Combining this with 2.5, we get that 
DegCost(r) < B(log/c(yA + k)) • g*(5), as required. The theorem follows. | 



3 Bounded-Degree Graphs 

In this section we show that the Secluded Path problem (and thus also the Secluded 
Steiner Tree problem) is A^P-hard on both directed graphs and weighted graphs, even 
if the maximum node degree is 4. We have the following. 

Lemma 3.1 The Secluded Path problem is NP-complete even for graphs of maximum 
degree 4 if they are either (a) node-weighted, or (h) directed. 

We start with (a). We present a reduction from Vertex Cover of the weighted Secluded 
Steiner Tree problem. Throughout we consider the special case of the weighted Secluded 
Steiner Tree problem, where S = namely, the Secluded Path problem. In [10] 
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Figure 2: Illustration of the gap between and DegCost*(5). The light purple nodes 

correspond to the set of terminals S. 



it was shown that the Vertex Cover problem is NP-complete even for planar graphs 
with maximum degree 3. Given a Vertex Cover instance in the form of a bounded- 
degree graph G{V^E)^ the instance {G\V' ^ ^W)^ s^t) to the weighted Secluded Path 
is constructed as follows. We add a "heavy" weight neighbor v to every node v G V{G). 
In addition, every edge e G is replaced by a gadget g{e) consisting of four nodes, 
^{di^)) = {U{e)^ D{e)^ L{e)^ R{e)} forming a diamond graph. Let S = [j^^E^idi^)) 
and V = [Jvevl^}- The set of nodes of is given hjV' = VUVUSU {s,t}. The 
weight function W : V ^ M>o is defined as follows: W{v) = 1 if G \V^ and 
W{v) = \V{G)\+4\E\ otherwise. 

We now describe the gadget construction. Each edge e = (u^v) is replaced by a 
diamond graph g{e) consisting of the edges 

Eig{e)) = {(«, Lie)), (i?(e), v), (L(e), [/(e)), (L(e), D(e)), (f/(e), i?(e)), (D(e), i?(e))} 

used to connect u and v in G^ (See Fig|3](a).) In addition, the new node s (respectively, 
t) is connected to the node U{e^) (respectively, D[e")) for some arbitrary edges e'^e"^ 
for e' 7^ e" . Finally, fix some ordering ei,...,e^ on the edges of E that starts with 
ei = e' and ends with = e" . Now, for every i G {1, ... ,m — 1} connect D{ei) with 
[/(e^+i), creating a "tour" from 5 to t via the gadgets. This completes the description 
of the instance [G'iV' ^ E\ W)^ 5, t). For a path P and set of nodes denote the set of 
neighbors of P's nodes in W by Nw[P] = N[P] n W. We now show that Nv{g)[P] is an 
optimal vertex cover for G iS P is an optimal s — t secluded path in the weighted graph 
G^ We begin by making an immediate observation. 
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Observation 3.2 For every optimal s — t secluded path P , V{P) n V{G) = 0. 

Proof: Assume the contrary, and let P be an optimal secluded path with at least one 
node V G V{G). Since v has a heavy neighbor it holds that Cost(P) > + V + 1. 
Consider next an alternative path P' that goes strictly through the nodes in i.e., follows 
the tour through the gadgets g{e). Since N[P^] C [J^^^ N[w] C S' U V{G)^ we get that 
Cost(P) > 4:E + V > Cost(P'), in contradiction to the optimality of P. | 

Consequently, in the rest of the proof, we consider only secluded paths P that do not 
visit nodes of V{G). Note that every such s — t path is of length 3\E\ and the set of S 
nodes is fully contained in neighbors of this path, i.e., Ns[P] = S. It therefore holds that, 

CostiP) = \S\ + \NviG)[P]\ . (6) 

The main factor determining the quality of an efficient path, among all paths that do 
not visit V{G)^ is the number of path neighbors in V{G)^ i.e., |A/'y((^)[P]|. That is, by 
choosing between the t^-side (respectively, '^-side) in every gadget g{e)^ for e = (u^v)^ the 
resulting path has u (respectively, v) in its neighborhood, i.e., Nv(g)[P]- Overall, there 
are \E\ such decisions to be made. Let VC{G) be an optimal vertex cover for G. We 
show the following. 

Claim 3.1 ql, = 4\E\ + \VC{G)\. 

Proof: We begin by showing that Cost(P) > 4|£'| + |1/(7(G)| for every s — t path P that 
does not go through the nodes of V{G). Let A^* = Nv(g){P)- We now show that A* is a 
legal vertex cover of G, hence \VC{G)\ < |A*| and by Eq. g, Cost(P) > 4.\E\ + \VC{G)\ 
as required. Assume for contradiction that there exists an edge e = (u^v) G E not covered 
by the nodes of A*, i.e., both u^v ^ A*. Consider the gadget g{e). Since P does not 
go through the nodes of V{G)^ it passes through U{e) and D{e). In particular, when 
reaching t/(e), a decision is made between passing through the u side L(e) or through 
the V side R{e). Without loss of generality assume that P passes through I/(e), i.e., 
P = [5, . . . , [/(e), I/(e), D{e)^ . . . , t]. Then, since 2/ is a neighbor of I/(e), it is in A* and 
we end with contradiction. 

Conversely, we now show that for every legal vertex cover (7 of G, there exists Slu s — t 
secluded path P whose cost satisfies Cost(P) < \C\ + AE. The path P is constructed as 
follows. Start from s and traverse the gadgets g{e) in order, from g{ei) to g{em)- Within 
each gadget g{e)^ for e = (u^v) such that {u^L{e)) G E' and {v^R{e)) G E' ^ move from 
[/(e) to D{e) through L(e) ii u ^ C and through R{e) otherwise. One of these choices 
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must hold, since C is a legal vertex cover, so it must contain at least one of the nodes u 
and V. Hence, the path P "pays" for a node v in V{G) only if v e C. Formally, we get 
that A^* C C. Hence, combining with Eq. Q, Cost(P) < + \C\. The claim follows. 
I 

Part (a) of the lemma follows. Finally, we now turn consider Lemma [3.1Kb). It is 



sufficient to show a directed version for the reduction such that Obs. 13.21 still holds. Note 
that the main essence is to prevent any optimal path from visiting the nodes of V{G) 
(and hence to visit every g{e)). In the weighted case, this goal is achieved by adding a 
"heavy" unique neighbor to every node in V{G) (the black nodes in Fig. [3](a)). In the 
directed case, this property is achieved by directing the edges in the following manner. 
Recall the ordering ei, . . . , on the edges which imposes a direction for each edge 
in our construction as follows. We include the arcs (5,[/(ei)), (i^(e^),t), ([/(e^), L(e^)), 
([/(ei),i?(e^)), (L(e^),L)(e^)), {R{ei), D{ei)), and in addition, (L)(e^), [/(e^+i)) for every 
i G {1, . . . , m — 1}. Finally, the edges between L(e^) and R{ei) to the nodes of V{G) are 
directed towards the nodes of V{G) (see Fig. [3](b)). This forces any s — t path in G^ to 



go through strictly the nodes in S. Obs. 3.2 now holds for the directed case, and the rest 



follows as in the weighted case. The lemma follows. | 




Figure 3: Illustration of the reduction from Vertex Cover, (a) The weighted case. Top: 
the graph G with the diamond graph gadgets g{e). The black nodes correspond to the heavy 
neighbor attached to each node v G V{G). Bottom: zoom into a single gadget, (b) The 
directed case. Directionality enforces visiting the gadgets in order, precluding the tour in V{G) 
nodes. 
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3.1 Polynomial-time algorithm for the unweighted undirected 
case. 

In contrast, for unweighted undirected case of bounded-degree graphs, we show that the 
Secluded Path problem is polynomiaL For two subpaths A, P2, define their asymmetric 
difference as 

DiS{P,,P2) = \N[P,]\N[P2]\ . 

Observation 3.3 (a) Diff{Pi, P2) < Diff{Pi, P') for every P' C Pa- 
(b) Cost(Pi o P2) = Cost(Pi) + Dtjf{P2, Pi). 

Proof: Part (a) simply follows as N[P^] C N[P2\. Part (b) follows by definition. | 
We have the following. 

Theorem 3.4 The Secluded Path problem is polynomial on unweighted undirected degree- 
bounded graphs. 

Note that in the previous section we showed that if the graph is either weighted or directed 
then the Secluded Path problem (i.e., the special case of Secluded Steiner Tree with 
two terminals) is NP-hard. In addition, it is noteworthy that the related problem of 
Minimum Labeled Path problem [23l ITl J is NP-hard even for unweighted planar graph 
with max degree 4 (this follows from a straightforward reduction from Vertex Cover). 

We begin with notation and couple of key observations in this context. For a given 
path P, let dp{s^ u) be the distance in edges between s and u in the path P. Recall, that 
A is the maximum degree in graph G. 

Observation 3.5 Let v be two nodes in some optimal s—t path P* that share a common 
neighbor^ i.e., N{u) n N{v) ^ 0. Then dp*{u^ '^) < A + 1. 

Proof: Assume for contradiction that there exists an optimal s — t path P* such that 
N{u) n A^(^) 7^ where u = and v = 1 < i < j for {i - j) > A + 2. RecaU 

that due to the optimality of P*, for every node 2/ in P*, the only neighbors oi u in G 
from among the nodes of V{P*) are the nodes adjacent to u in P* (otherwise the path 
can be shortcut). Let w G N{u) D N{v) be the mutual neighbor of u and v and consider 
the alternative s — t path P obtained from P* by replacing the subpath Q = P[i^ ... ^ j] hy 
the subpath P' = [u^ v]. Let Q~ = P[i + 2, . . . , j — 2] be an length--^' internal subpath 
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of Q where > A — 1. Then since the degree of w is at most A, it foUows that 

Cost(P) < N[V{P) \ V{Q-)] + A - 2 , (7) 

where A — 2 is an upper bound on the number w' s neighbors other than u and v. In 
addition, note that by the optimahty of P* it contains no shortcut and hence V{Q~) n 
{N\V{P) \ V{Q~)]) = (i.e., for node P[i] the only neighbors on the path P are P[i — 1] 
and P[i + 1]). Thus, 

Cost(P*) > N[V{P)\V{Q-)] + \V{Q-)\ 

> N[V{P)\V{Q-)]+A-l>Cost{P), 

where the last inequality follows from Eq. ([7|, contradicting the optimality of P*. The 
observation follows. | 

In other words, the observation says that two vertices on the optimal path at distance 
A + 2 (which is constant for bounded-degree graphs) or more have no common neighbors. 
This key observation is at the heart of our dynamic program, as it enables the necessary 
subproblem independence property. The difficulty is that this observation applies only to 
optimal paths, and so a delicate analysis is needed to justify why the dynamic program 
works. Note that the main difficulty of computing the optimal secluded path P* is that 
the cost function Cost(P*) is not a linear function of path's components as in the related 
DegCost measure. Instead, the residual cost of the ith vertex in the path depends on the 
neighborhood of the length- (i — 1) prefix of P*. This dependency implies that the secluded 
path computation cannot be simply decomposed into independent subtasks. However, in 
contrast to suboptimal s — t paths, the dependency (due to mutual neighbors) between 
the components of an optimal path is limited by the maximum degree A of the graph. 
The limited dependency exhibited by any s — f optimal path facilitates the correctness of 
the dynamic programming approach. Essentially, in the dynamic program, entries that 
correspond to subsolution a of any optimal s — t^ path enjoy the limited dependency, and 
hence the values computed for these entries correspond to the exact cost of an optimal 
path that starts with s and ends with a. In contrast, entries of subpaths a that do not 
participate in any optimal s — t^ path, correspond to an upper bound on the cost of some 
path P that starts with s and ends with a. This is due to the fact that the value of the 
entry is computed under the limited dependency assumption, and thus does not take into 
account the possible double counting of mutual neighbors between distant vertices in the 
path. Therefore, the possibly "falsified" entries cannot compete with the exact values, 
which are guaranteed to be computed for the entries that hold the subpaths of the optimal 
path. This informal intuition is formalized below. 
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For a path P of length £ > A + 1, let Suff (P) = (w£_a, ...,Vi)he the (A + l)-suffix of 

P. 



Lemma 3.6 Let P* be an optimal s — t' path of length > A + 1. Let P* = Pi o P2 
be some partition of P* into two subpaths such that \Pi\ > A + 1. Then Diff(P2, Pi) = 

W(P2,Suff(Pi)). 

Proof: For ease of notation, let £ = |Pi|, = . . . J - {A + 1)], and a = Sufr(Pi). 
By definition, Difr(P2, Pi) = \ (^^[^2] \ N[a]) \ N[F]\. In the same manner, Difl[(P2, (t) = 
I {N[P2] \ N[a]) |. Assume for contradiction that the lemma does, namely, DiS{P2^o') ^ 



DiflF(P2,i^' o a). Then by Obs. Qa) we have that DiS{P2,a) > DiS{P2,P' o a). This 
implies that N[P2] n N[P'] ^ 0. Let u G N\P2\ n N\P\ There are two cases to consider: 
(a) u ^ P\ and (b) u has a neighbor V\ in P' . We handle case (a) by further dividing it 
into two subcases: (al) G P2, i-e., u occurs at least twice in P*, once in P' and once in 
P2, and (a2) u has a neighbor V2 in P2. Note that in both subcases, there exists a shortcut 
of P*, obtained in subcase (al) by cutting the subpath between the two duplicates of u 
and in subcase (a2) by shortcutting from 2/ to '^2- This shortcut results in a strictly lower 
cost path, in contradiction to the optimality of P*. We proceed with case (b). Let V2 G P2 
be such that u G A^[^2]- If ^2 = then clearly the path can be shortcut by going from 
V\ G P' directly to 2i G P2, resulting in a lower cost path, in contradiction to the optimality 
of P\ If V2 ^ u, then rfp*(^i,^2) > A + 2 (since |cr| > A + 1) and A^[^i] n A^(^2) 7^ 0, 



which in contradiction to Obs. 3.5, The Lemma follows. | 



Corollary 3.7 Let P* he an optimal s — f path of length > A + 1. Let P* = Pi o P2 
be some partition of P* into two subpaths such that \Pi\ > A + L Then Cost(P*) = 
Cost{Pi) + Dtff{P2,SuS{Pi)). 

For clarity of representation, we describe a polynomial algorithm for the Secluded Path 
problem, i.e., where S = {s^t} and in addition A < 3, in which case Suff(P) = 
('^^_3, . . . , '^^). The general case of A(G) = 0(1) is immediate by the description for 
the special case of A = 3. The case of Secluded Steiner Tree with a fixed number of 



terminals \S\ = 0(1) is described in Subsection 3.2, 

The algorithm we present is based on dynamic programming. For each 4 < i < n, 
and every length-4 subpath given by a quartet of nodes a C V{G)^ it computes a length-i 
path 7r(cr, i) that starts with s and ends with a (if such exists) and an upper bound /(a, i) 
on the cost of this path, f{(J^i) > Cost(7r((7, i)). These values are computed inductively, 
using the values previously computed for other a's and i — 1. In contrast to the general 
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framework of dynamic programming, the interpretation of the computed values /(a, i) and 
7r((j, i), namely, the relation between the dynamic programming values /(a, i) and 7r(a, i) 
and some "optimal" counterparts is more involved. In general, for arbitrary a and the 
path 7r((7, i) is not guaranteed to be optimal in any sense and neither is its corresponding 
value /(cr, i) (as /(cr, i) > Cost(7r(cr, i))). However, quite interestingly, there is a subset 
of quartets for which a useful characterization of /(cr, i) and 7r((7, i) can be established. 
Specifically, for every 4 < i < n, there is a subclass of quartets ^^*, for which the computed 
values are in fact "optimal", in the sense that for every a G ^^*, the length-i path 7r(a, i) 
is of minimal cost among all other length-i paths that start with s and end with a. We 
call such a path a semi-optimal path, since it is optimal only restricted to the length and 
specific suffix requirements. It turns out that for the special class of quartets ^^*, every 
semi-optimal path of a G is also a prefix of some optimal s — f path. This property 
allows one to apply Cor. |3.7[ which constitutes the key ingredient in our technique. In 
particular, it allows us to establish that /(cr, i) = Cost(7r(a, z)). This is sufficient for 
our purposes since the set ljr=i contains any quartet that occurs in some optimal 
secluded s — t path P*. Specifically, for every s — t optimal path P*, it holds that the 
quartet Suff(P*) = - 3,i] satisfies Suff(P*) G ^^*. The correctness of the dynamic 
programming is established by the fact that the values computed for quartets that occur 
in optimal paths in fact correspond to optimal values as required (despite the fact the 
these values are "useless" for other quartets). 



3.1.1 The algorithm 

If the shortest path between s and t is less than 3, then the optimal Secluded Path can 
be found by an exhaustive search, so we assume throughout that dcis^t) > 3. Let be 
the set of all length-4 subpaths a in G. That is, for every a G we have V{a) C V{G) 
and (cr[i], a[i + 1]) G E{G) for every i G [1, 3]. For a G define the collection of shifted 
successor of G as Next{a) = {(cr[2], . . . ,cr[4],t^) | u G (A^[cr[4]] \ a)}. For every pair (a, i), 
where a G and i G {!,..., n}, the algorithm computes a value f{cr^i) and length-z 
path 7r((j, ending with a (i.e., 7r(cr, — 3, = a). These values are computed 

inductively. For i = 4, let 7r(cr, 4) = a and 

{Cost(a), if = s 
oo, otherwise 
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Once the algorithm has computed /{cr^j) for every 4 < j < i — 1 and every a G in 
step i G {5,n} it computes 

/(a, i) = min{/(a^ i - 1) + DiS{a, a)\ae Next{a')}. (8) 

Note that 

Difr(a,aO = Diff(a[4],aO = |A^[^[4]] \ N[a']\. (9) 

Let a' G such that a G Next{(j') and a' achieves the minimum value in Eq. ([s]). Then 
define Pred{a) = a' and let 

7r(a, = 7r{Pred{a),i - 1) o a[4]. (10) 

Let g*^ = min{/(cr, | i G {4, . . . , n}, Last{7r{a, i)) = t}, and set P* = 7r(cr*, i*) for a*, 
such that f{a''^i'') = q^^. Note that there are at most 0{n^ • n) entries f{a^i)^ each 
computed in constant time, and so the overall running time is O(n^). 

Analysis. Throughout this section we make use of two notions of optimality for secluded 
paths. The two notions are parameterized by a G and by an index i G {1, . . . , n}. Let 
P(cr, i) be the set of length-i paths P starting at P[l] = s and ending with Suff (P) = a for 
P G V{a^i). Denote the minimum cost of a path in V{a^i) by ^^{cf^i) = min{Cost(P) | 
P G V{a^i)} and denote the set of paths attaining this minimum cost by Qsemii^^i) = 
{P G P(cr, i) I Cost(P) = g*(cr, i)}. We refer to paths in Qsemi{(^^ i) as semi-optimal paths, 
since they are optimal only among length-i paths ending with a. Note that in general, a 
semi-optimal path P G Qsemii^^-ii) is not necessarily an optimal secluded path from s to 



nor is it a prefix of an optimal secluded path from s to any f , and thus Cor. 3.7 
cannot be applied. However, we next argue that there exists a subset of quartets a* G 
for which every path P G Qsemi{(^)i) is a prefix of an optimal s — t' secluded path, and 



since Obs. |3.5| holds for any subpath of some optimal s — t' path, we can apply it for this 
subset of sequences. Let Q^, = {P G Vg^t' \ Cost(P) = q'*^/} be the set of all optimal s — f 
paths for f G V{G)\{s}. A path P that is a prefix of some optimal s — f path is hereafter 
referred to as a pref-optimal path. Let Q*(cr, i) = {P G IJf^Gy(G')\{s} ^t' I i] = a}. 

That is, Q*(cr, i) is the set of all optimal secluded paths in which the subpath in positions 
{i — 3) to i is equal to a. Note that in contrast to the set Qsemi{(^^ i)^ which is nonempty 
for every a G and might contain suboptimal secluded paths, the set Q^'^a^i) might be 
empty for many choices of a, but every path P G Q*(cr, i) is indeed optimal. 

Define the subset of subpaths for which i) is nonempty, for z G {4, . . . , n}, as 

^; = {ae^ I e*(a,z)7^0}. 
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We next establish some useful properties for a G ^^*. Let Q^(cr, i) correspond to the set of 
paths in i) but truncated at position i, that is, Q^{a^i) = | P G i)}. 

Clearly, every path P in Q^{a^i) is of length i. Moreover, since Q*(cr, i)} contains op- 
timal paths, Q^{a^i) consists of pref-optimal paths. It is important to understand the 
distinction between the set Q^{a^i) and the set Qsemi{(^^i)- Both sets contain length-z 
paths that start with s and end with a. However, while Pi G Q^(cr, i) (if it exists) corre- 
sponds to a prefix of some s — t' optimal path, a path P2 G Qsemi{(^i i) is not necessarily a 
prefix of some optimal path; it is only optimal among paths of length i ending with a. In 



particular, this difference implies that Cor. 3.7 can be safely applied to paths in Q (a, i). 



We first provide a general observation regarding the computed values f{(J^i) and 
7r(a,i). 

Observation 3.8 For every a G andi G {1, . . . ,n}, z/P* G Qsemi{<7,i) thenCost{P'') < 
Cost(7r(a, i)) < f{a^i). 

Proof: The left inequality follows immediately, since both 7r(a, and P* are in V{a^i) 
and P* is a semi-optimal path, i.e., it has the lowest cost among these paths. The right 
inequality is proved by induction on i. for i = 4, the claim holds trivially. Assume it 
holds for every a and every j < i — 1, and consider a fixed a and i. Let = Pred{a). 
Then 



Cost(7r(cr, i)) = Cost(7r(cr', i — 1) o cr[ 

= Cost(7r(a,i-l)) + Difr(a[4],7r(a,i-l)) (11) 

< /(a^^-l) + Difr(a[4],aO (12) 

= /(a^^-l) + Difr(a,aO (13) 

= /(^,0 , (14) 

where Ineq. (11) follows from Obs. |3.3| (b), Ineq. (12) follows from the inductive assump- 
tion, Eq. (13) follows from Eq. ^ and Eq. (11) follows from Eq. ([s]). The observation 
follows. I 

It turns out that for the set of special quartets that occur on some optimal s — t path, 
a stronger characterization of 7r(a, i) and f{(J^i) can be established. 

Lemma 3.9 For every i and a G ^ 
(a) 7r(cr, z) G Qsemi{cr,i), GLnd 
(h) /(a,z) =Cost(7r(a,z)). 
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Let P* be some optimal secluded path and let a = Suff(P*). Since a G ^^|p*|, by Lemma 



3.9 we get that the computed path 7r(a, |P*|) is indeed an optimal s — t path of cost 



/(a, |P*|). Our remaining goal is therefore to establish Lemma 3.9 



Proof Sketch. We prove that the inequalities of Obs. |3.8| are in fact equalities for every 
4: < i < n and every a G ^^*. The proof, by induction on is not immediate and requires 
several properties to be established and to come into play together. To begin, assume that 



both parts of Lemma 3.9 hold for every a G for every 4 < i < £ — 1 and consider some 
a G In addition to the inductive assumption, we assume that Pred{a) G (which 
would be established later on). Given these assumptions, for part (a) of the lemma we 
show that 7r((j, £) = 7r(Prerf(cr), ^ — 1) ocr[4] G Qsemi{(^^ Next, to show that part (b) also 
holds, i.e., that /(a, i) = Cost(7r(a, ^)), we would like to show that 7r(cr, £) is a pref-optimal 
path and thus Cor. |3.7| can be applied. Our argumentation can be sketched as follows. We 
prove that for every a G ^^*(^) it holds that (a) every semi-optimal path P G Qsemi{^^^) 
is pref-optimal, and (b) there is a set of quartets denoted by Parent{a^£) C for 
which the following holds for every G Parent{a^£): (bl) P^ = Poa[4\ G Qsemi{(^i^) for 
every P G Qsem^(<^^^— 1), and hence P^ is pref-optimal, and (b2) Pred{a) G Parent{a^£) 
and thus the inductive hypothesis can be established. 



Detailed analysis. We proceed by showing that every semi-optimal length-i path P G 
Qsemii^j i) for (J G ^^*, i G {4, . . . , n} is a pref-optimal path. 

Lemma 3.10 For every a G Q^{cr,i) = Qsemi{o-,i)' 

Proof: We first show that Q^{a^i) C Qsemi{(^^i)- To do that, we consider a path Pi G 
Q*(a,i) for which G and show that Cost(Pi[l, i]) < Thus 

G Qsemii^^i)- Let £ = I Pi I and f = Last (Pi), thus Pi is an optimal s — t' 
path. Assume for contradiction that Cost(Pi[l, i]) > q''{a^i). Let P2 G Qsemi{(^)i)) i-^., 
Cost(P2) = q'*(cr, i). Consider an alternative s — t' path P3 = P2 o Pi[i + 1, Note that 
P3 is indeed a legal s — t' path since P2 [i] = Pi [i] . We now compute its cost and compare 
it to that of Pi. 

Cost(P3) = C0St(P2) + DiflF(Pi[i + l,^],P2) (15) 

< Cost(P2) + DiflF(Pi[i + l,^],a) (16) 

< Cost(Pi[l,i]) + Difl[(Pi[z + l,^],a) (17) 
= Cost(Pi), (18) 
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where Ineq. (15) and (16) follow by parts (b) and (a) of Obs. 3.3 and Ineq. (17) is by the 



contradiction assumption. Eq. (18) follows from the fact that Pi is an optimal s — f path 
and thus Cor. 3.7 can be safely applied. We therefore get that Cost(P3) < Cost(Pi), in 



contradiction to the optimality of Pi. 
Conversely, we now prove that Qg 



{cr,i) C Q^{a,i). Let Pi G Qsemi{cr^i) and 



P2 e Let e = IP2I and f = Last(P2). Consider the path P3 = Pi o P^li + IJ] 

obtained by replacing the length-i prefix of P2 with Pi. We show that P3 has the same 
cost, Cost(P3) = Cost(P2), hence P3 is also an optimal s — path. This would imply 
that P3 G Q*((7, i) and Pi G Q^{a^i)^ establishing the lemma. 



Cost(P3) = Cost(Pi oP2[^ + 1,^]) = Cost(Pi) + Difl[(P2[^ + 1,^], A) 

< Cost(Pi) + Diff(P2[^ + l,^],^) 

< Cost(P2[l,^]) + Difl[(P2[^ + l,^],^) 

= C0St(P2) , 



(19) 
(20) 



where Ineq. (19) follows from the fact that both Pi and P2[l,^] are in P(cr, i), i.e., both 



are length-i paths starting at s and ending with a and Pi is semi-optimal. Ineq. (20) 



follows from the fact that P2 is an optimal s — f path, thus Cor. |3.7| can be safely applied. 
The fact that P2 is optimal necessitates equality. The lemma follows. | 



For a G and i G {5, . . . , n}, define 

Parent{a,i) = {a ={u,a[l],...,a[3]) \ 3P G Q*(a, z), P[i - 4, 



u o 



a}. 



That is, a' G Parent{a^ i) iff there exist some t' G V{G) \ {s} and an optimal s — t' path 
P* such that — 4, i] = o a. Note that by the definition of ^^*, there exists at 
least one pref-optimal path P of length i that ends with a. In addition, since i > 5 there 
exists at least one a' G Parent{a^ i) corresponding to = P[i — 4, i — 1]. In other words. 



Parent{a^ i) ^ for every i > 5 and a G ^^*. Lemma 3.10 implies that the following holds 
for every a G ^^*. 

Observation 3.11 If a e then 

(a) for every a' G Parent{a^i) there exists a semi-optimal path P* G Qsemii^^^i)^ which 
is also a pref-optimal path, such that P*[i — 4, i] = a' o cr[4]. 
(h) Parent{a,i) C ^^*_^. 

We first provide an auxiliary claim that holds for every i G {1, . . . , n}. Let a, be such 
that (a) a G and (b) a' G Parent{a^i). 
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Claim 3.2 For every P G Qsemi{(^', i-l), Poa[A]e Qsemi{(^, i) 



Proof: Let P e Qsemii^', i) such that a' = P[i — 4, i — Note that since a' e Parent{a, i), 
by Obs. 3.11 a), such a P is guaranteed to exist. Let P G Qsemi{o'',i — 1) and let 
P" = Po cr[4]. We now prove that Cost(P") < Cost(P) and therefore P" G Qsemi{(^,i)- 



Cost(P") 



= Cost(P o (j[4]) = Cost(P) + Diff(cT[4], P) 

< Cost(P) + Diff((j[4],(j') 

< Cost(P[l,i- 1]) + Diff(a[4],a') 
= Cost(P) , 



(21) 
(22) 
(23) 



where Ineq. (|2T|) follows from Obs. |3.3|(a), Ineq. (22) follows from the fact that both 



i — 1], P G V{a\ ^ ~ 1) P, being semi-optimal, has the minimal cost in this set. 
Eq. ([23]) follows from the fact that P G Qsemi{o'^ i) and since a G ^^*, Lemma 3. 10| implies 
that P is a pref-optimal, i.e., a prefix of some optimal secluded path, i.e., P G Q^{(J^i). 
Therefore, Cor. 



3.7 



can be applied. | 



We now turn to proving Lemma 3.9, showing that for every and every a G ^^*, the 



inequalities of Obs. 3.8 become equalities. 



Proof of Lemma 3.9 : The lemma is proven by induction on i. For i = 4, both claims 
clearly hold. Assume the claims hold for i < i — 1 and consider i. Let = Pred{a). To 
be able to apply the inductive hypothesis to a', we first show the following. 



Claim 3.3 a' G Parent{a,i). 

Proof: Note that since a G it holds that Parent{a,^ 



by Obs. 3.11(b). Let a* G Parent{a,£), thus a* G By Lemma 



7^ 0, and also Parent{a^£) C 



3.10 



Qsemi{c, and Q-^(a*, £ — 1) = Qsemii<^* , ^—^)- Let us pick three arbitrary representatives 

P*{a*,e- 1) G Qsemi{(T*,i- 1), P*{a',e- 1) G aemi(^',^ - 1), P*{(T,i) G Q.em^(fT,^). 

— 1) and P*{a,i) are pref-optimal paths and 



RecaU that due to Lemma 3.10, P*(a 



therefore Cor. 3.7 can be safely applied for them. We first show that 



Cost(P*(a,£)) = Cost(P*((j*,£- 1)) + Diff((j,a*). 



To see this, let P' 



(24) 
In 



P*{a*,£ - 1) o (t[4]. Then by Claim [S^ P' G Q, 
addition, by Lemma 3.10, P' G Q^(cr, £) and therefore Cor. 3.7 can be safely applied, 
so Eq. (|24[ is established. We next show that P" = P*{a',i- 1) o cr[4] G QsemiicrJ)- 
Clearly, P" G P(ct,£), so it suffices to show that Cost(P") < Cost{P*{aJ)) as P*ia,i) is 



26 



semi-optimal. Indeed, 

Cost(P") = Cost{P*{a',£-l)) + DiS{a[4\,P*{a',i-l)) 

< Cost{P*{a',i-l)) + DiS{a[4],a') 

< f{a',£-l) + DiS{a,a') 

< f{a*,i-l) + DiS{a,a*) 

= Cost(7r((j*,£- 1)) + Diff((j,a*) 

= Cost{P*{a*,e - 1)) + Diff((T, a*) = Cost(P*(a, i)) 



(25) 
(26) 
(27) 
(28) 
(29) 



where Ineq. (25) follows from Obs. |3.3K a) and Ineq. ([26]) follows from Lemma 3.8| and 
Eq. Ineq. (27) follows from the fact that a G Next{(j') nNext{a'') and attains the 
minimum value of Eq. ([s]). Eq. (28) follows from part (b) of the inductive assumption 
for ^ — 1. In Eq. (29) the first equality follows from the fact that both 7r(a*,^ — 1) and 
1) are in 2,emz(^*, ^ - 1), and thus Cost(P*(a*, ^ - 1)) = Cost(7r(a*, ^ - 1)) = 
g*((7*,^ — 1), and the second equality follows from Eq. (24). 

We therefore get that P^^ G Qsemi{(^^ ^) and since a G by Lemma 3.10 it holds that 
g' G Parent[G^i\ as required. 



1) oa[4] is also a pref-optimal path. This implies that 



I 



By Claim 3.3 and Obs. 3.11 [^b), G It follows that the inductive hypothesis 

can be applied to a' . By part (a) of the inductive assumption for ^ — 1, it holds that 
7r(cr',^ ~ 1) ^ Qseraiip' A ~ l)- Finally, by Claim [3^ we get that the path 7r(a, ^) chosen 
by the algorithm satisfies 7r((7, ^) = 7r((7', ^ — 1) o cr[4] G Qsemi{^^^^\ which establishes (a). 
We now consider (b). Note that 



j{o,£) = f{a',i-l) + DiS{a,a') 

= Cost(7r(a',£- 1)) +Diff(a,(j') 
= Cost(7r((T',£- 1) o(t[4]) , 



(30) 
(31) 



where Eq. (30) follows from the inductive hypothesis for i—1 and Eq. (31 ) follows from the 



fact that 7r(cr', ^— 1) ocr[4] G Q^(cr, £) and thus Cor. 3.7 can be applied. This completes the 



proof of Lemma [3^ We are now ready to complete the correctness proof of the algorithm. 
Let P* be an optimal s — t path of length i. Then — 3,^] G Therefore, by Lemma 
3^ P*[l,^] G Qsemi{P"[^-SJ]J)^ and so - 3, ^], ^) = Cost(P*), and by Lemma 



3.8, the cost of any other a such that t ^ and every i, satisfies f{(J^i) > 7r(cr, i) > t. 



Thm. 13.41 is established. 



Corollary 3.12 The Secluded Path problem can be solved in 0{n'^^^) time. 
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This follows since there are 0{n'^^^) distinct length-(A + 1) subpaths, and the length of 
the path is bounded by n. Hence, overall there are 0(n^^^) entries and the value for each 
entry is computed in 0(A) time. For the Secluded Steiner Tree problem, we show the 
following. 

Theorem 3.13 On unweighted undirected degree-bounded graphs^ we have the following: 
(a) for arbitrary k, the Secluded Steiner Tree problem is NP-hard; (b) for k = 0(1); 
the Secluded Steiner Tree problem is polynomial. 

Proof: We start with (a). We prove this by reduction from Vertex Cover. By [lOj, we 
know that Vertex Cover is NP-complete even for planar graphs with maximum degree 
3. Given an instance of the Vertex Cover problem consists of a bounded-degree graph 
G = (y, £"), we transform it into an instance of the Secluded Steiner Tree problem that 
consists of a bounded-degree graph G and a set of terminal S. Let 5 be a complete binary 
tree with n leaves L = {^i, . . . ^in}- Consider some ordering on V{G) and connect £i to 
Vi^ for every i G {1, . . . , n}. In addition, for each vertex Vi G V{G)^ add a single neighbor 
Vi in G. Next, for each edge e = {vi^Vj) G E{G)^ add a vertex v'^. Every edge e G E{G) 
is replaced by the two edges {v'^^ Vi) and ('U^, Vj) in G. In sum, the graph G is defined by 

V{G) = V{G)UV{B)U U {v,}U U {<}, 

Vi^V{G) eeE(G) 

and 

n 

E{d) = E{B)u[j{{i,,v,),{v,,v,)}U U {{v',,v,),{v',,v,)}. 

1=1 eeE{G) 

Note that the maximum degree A(0) in G is at most A(0) + 1. Hence G is a bounded- 
degree graph as well. The terminal set S = [JeeE{G)We} ^ ^(^) includes the v'^ nodes 
and the nodes of the binary tree V{B). We now show that G has vertex cover of size k iff 
there exists a secluded Steiner tree T in G of cost \S\ + \ V{G) \ + k. Recall that VC{G) 
is some optimal vertex cover in G and q*{S) is the cost of the optimal secluded Steiner 
tree in G. We now show that 

q*iS) = \S\ + \V{G)\ + \VC{G)\ . 

We begin by showing that q*{S) < |<S| + + |yC(G)|. Given a vertex cover C' for G, 

we show that there exists a Steiner tree T C G of cost Cost(T) < \S\ + |V'(G)| + \C'\. The 
tree T is given by V{T) = S\JC' and E{T) = E{B) U Vi) \ e C'} U {«, Vi),e = 
ivi,Vj) G E{G),Vi G VC{G)}. Since C' is a legal vertex cover in G, it follows that every 
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terminal v'^ has at least one neighbor Vi (corresponding to the endpoints of e in G) that 
connects it to the tree T. Hence, T is a valid Steiner tree. We now analyze its cost. The 
cost of T consists of the set Vi = S\J V{G) of the terminals and their neighbors, and the 
set V2 = {v^ I G C}. Therefore, Cost(T) = l^il + \V2\ = \S\ + \V{G)\ + \C'\ and thus 
< \S\ + \V{G)\ + \VC{G)\ as required. 

It remains to show that \VC{G)\ < - {\S\ + \V{G)\). Given a Steiner tree T in 

G, we show how to obtain a legal vertex cover C such that \C\ < Cost(T) — {\S\ + 
Let C = V{T)nV{G). First, note that C is a vertex cover. Assume for contradiction that 
it is not, and let e = (vi^Vj) be a non-covered edge by C. Since neither Vi nor vj in (7, 
we get that v^^ is disconnected in T, in contradiction to the fact that T is a Steiner tree. 
Finally, observe that Cost(r) = \S\ + \V{G)\ + \V{G)nV{T)\. Since C = \V{G)nV{T)\, 
part (a) of the theorem is established. We now turn to prove Thm. 3.4K b) and show 
that there exists a polynomial-time algorithm if the number of terminals is bounded by a 
constant. 



3.2 Polynomial-time algorithm for Secluded Steiner Tree with 
\S\=Oil). 

For simplicity we present the algorithm for maximum degree three (and note that the 
proof works for any fixed bounded degree). Let k = \S\. We note that the main difference 
with Secluded Steiner Tree problem is that here we also need to take into account the 
''skeleton" of the tree. We iterate over possible skeletons of trees, where by skeleton of a 
tree we mean the following. Consider some tree T and contract every path P G S(T) such 
that all nodes in P but the endpoints are of degree 2 and are not terminal into a single 
edge. The resulting graph is the skeleton of the tree T. Note that it is enough to look on 
trees T such that all leaves are terminals (as otherwise we can take a subtree of the tree 
spanning all terminals). Note that the skeleton may contain at most k nodes that are 
not terminals. Given the (at most k) nodes that are not terminals, an upper bound on 
the number of different skeletons is the number of different trees with at most 2k nodes, 
which is 0((2/c)^^~^). There are 0{n^) options to choose the nonterminal nodes in the 
skeleton. We thus get that there are at most 0(n^(2/c)^^~^) different skeletons. 

A subtree Sj^ is an extended skeleton of the skeleton S if it obtained by replacing each 
edge Ci = (x^, i/i) in the skeleton S with a path Pi of the following form. Either the path 
Pi is a path of G of length at most 9 (and contains at most 8 internal nodes), or the prefix 
of the path Pi is a subpath of length 4 in G from Xi to some node Zi and the suffix of Pi 
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is a subpath of length 4 in G from some node Wi to y^, the nodes Zi and Wi are connected 
by an imaginary edge (which wiU be replaced later by a path from G). 

The algorithm for finding the tree with the lowest cost is as follows. Iterate over all 
possible skeletons S and for each skeleton S iterate over all possible extended skeleton 
5^+. For each do the following. Let V{S^) be the set of nodes in 5^+. For each 
imaginary edge (x, y) in 5^+, consider the graph y) that is obtained by removing all 
nodes V{S^) \ {x^y} from G. Find the lowest cost Secluded Path P[x^y) from x to y 
in G\x^ y). For each imaginary edge (x, y) in 5^+, replace the edge with the path y). 
Let T{S^) be the tree obtained through this process. Finally, return the tree T{S^) with 
the lowest cost. Let Taig be the returned tree. 

We call nodes in a tree T splitting nodes if their degree in T is greater than 2. We 
call a node special in T if it is either a splitting node or a terminal. Consider the optimal 
tree T* and let aS* be its skeleton and S"^ be its extended skeleton. We claim that 
Cost(T(5'^)) = Cost(T*), this will imply the correctness of the algorithm. 

As in the Secluded Path problem we may restrict ourselves only to trees such that all 
nodes that their parent are not special do not have edges to nodes on the tree that are 
not in their subtree, similarly, every node that none of whose children is special has no 
edges to nodes in its subtree (otherwise we can find a diflFerent tree with a subset of the 
nodes of T*). In other words, all nodes in T* that are neither special nor have a neighbor 
in T that is special have no edges in G to other nodes in T* that are not adjacent to them 
in T*. 



The following observation is an extension of Obs. 3.5 and is crucial for the algorithm. 



Observation 3.14 Consider an edge ei in the skeleton and their corresponding path 
Pi = (^1, ^n) ^ S(r*). The node Zi for 6 < i < ri — 6 cannot have a common neighbor 
with any other node z G T* \ Pi . 

Lemma 3.15 Cost(r(5;)) = Cost(r*) 



Proof: The proof follows from Obs. 3.14, Note that in the optimal solution T*, N[S^] C 
A^[T*]. Consider an imaginary edge (x^y) in S'^. Let P''{x^y) be the subpath from x to 
y in T*. Note that the subpath P*(x',^') are disjoint for all imaginary edges {x' ^y') in 
SI . Let c*(x^2/0 = |iV[P*(x^2/0] \iV[5;]|. Note that Cost(r*) = E(.^^o ^*(^'' ?^') + 
\N[Sl]\. Note also that \N[P{x,y),G'{x,y)]\ < c*(x^2/0• We get that Cost(r(5;)) < 
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E(.,,) \N[P{x,y),G\x,y)]\ + \N[Sl]\ < Cost(T*). | 

4 Secluded Connectivity for Specific Graph Families 

Bounded-treewidth graphs. For a graph G{V^ let TW{G) denote the treewidth of 
G. In this section, we consider graphs G of constant treewidth, i.e. TW{G) = 0(1). A 
separation of a graph G = (V^ E) is a triplet (A, 5, S) where A\J B \J S = V ^ An B <:i S 
and [u^ v) ^ E for every u G A \ S and v G B \ S. The set S then separates A \ S' and 
B\S in G. The concept of treewidth was introduced by Robertson and Seymour [22j using 
tree-decompositions. (See [21 [3] for an in-depth introduction to this topic.) For a graph 
G = {V^ E)^ a tree decomposition is a pair (T, X) consisting of a tree T = (/, A) and a 
collection X = {Xi}i^j of vertex subsets (called bags) with the following properties: (Tl) 
Every vertex v G V{G) is contained in the least one bag Xi. For every edge (u^v) G E 
there is at least one bag Xi containing both vertices v. (T2) For every vertex v G V{G)^ 
the nodes i of T with v G Xi form a subtree of T- 

To avoid confusion, the elements of / are referred to as nodes and the elements of 
V{G) are referred to as vertices. In addition, we may informally refer to the node Xi by 
its index i. Let X^ be the set containing the vertices of node i itself, and the vertices in 
the bags of its descendants in T- Let Ui (respectively, U^) be the set of terminals in Xi 
(respectively, X^). Let u) be some upper bound on TW{G) such that \Xi\ < — 1 for 
every i G /. Let be the set of terminals in X^. We show the following. 

Theorem 4.1 The Secluded Steiner Tree problem (and hence also the Secluded Path prob- 
lem) can be solved in linear time for graphs with fixed treewidth. In addition, given the 
tree decomposition of G, the Secluded Steiner Tree problem is solvable in 0{n^) if 
TW{G) = 0(logn/ loglogn). This holds even for weighted and directed graphs. 

If the tree decomposition is not known, then for fixed treewidth graphs it can be computed 
in linear time [Ij. For simplicity, we consider the unweighted undirected case. The 
weighted and directed cases are considered by the end of the analysis. Given a tree 
T in G and a separation (A^B^S) in G, let Fa = T{A) (respectively, Fb = T{B)) 
be the forest obtained by considering the edges of T restricted to A (respectively, B) 
nodes. Let Difr(A,5) = A^[r] n{A\S) and Difr(5,5) = A^[r] n{B\ S). Finally, let 
Cost(r, S) = N[T] n S. Since (A, B, S) is a separation, it follows that 

cost(r) = Difr(^, s) + Difl[(5, s) + cost(r, s). (32) 
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Such decomposition of the cost function Cost(G') holds for any connectivity construct 
C G. Indeed, the correctness of the dynamic programming apphed on the tree- 
decomposition of G is estabhshed due to this cost independency property of Eq. (32). 



4.0.1 Algorithm 

The Secluded Steiner Tree algorithm is an extension of the algorithm of [6] for comput- 
ing an optimal Steiner Tree in a bounded-treewidth graph. The input to the algorithm 
is a connected graph G = (1/, a set of terminals S and a tree decomposition T = (/, A) 
of G. For ease of analysis, we assume that the tree decomposition T is nice in the sense 
that every node i G / has at most two children ii^i2 ^ I in the tree decomposition T (as 
in [6J). 

Initially choose any root node r G / that contains at least one terminal, i.e. XrHS ^ 0, 
and direct the tree T such that r is the root. For each i ^ I we maintain a table tb{i). 
Each entry /^^ G tb{i) in the table represents a configuration, implicitly corresponding to 
a subsolution. Such a subsolution is a forest F C G that when imposed on the nodes of 
X^^ resulting in F{X^)^ spans the terminals in X^^ and F{X^) may be extended 
into a tree that spans all the terminals. Specifically, the configurations define a collection 
of disjoint subsets of the vertices of Xi and a collection of edges between vertices in X^. In 
addition, it specifies the vertices v G X^ that are neighbors of the forest F (note that for 
such 'U, its neighbor in F is not necessarily in X^; it might be in an ancestor of X^ in T) . 
We follow the representation scheme of [6j where each subsolution is represented by a color 
assignment /^^ to the vertices of X^, and extend it by adding an additional color uj that 
represents the status of being a neighbor. We refer to this coloring as a configuration. The 
interpretation of the colors is representing connected components in F{X^) (the forest F 
restricted to the vertices of X^) and vertices in Xi that have neighbors in F, i.e., vertices 
u G Xi with the same color /^^(2i) = c G [1, cj — 1] belong to the same connected component 
in the subsolution forest F{X^). The vertices u such that tvi{u) = u) do not participate in 
the forest but have at least one neighbor in F. Finally, vertices u with i^i{u) = neither 
belong to the subsolution represented by /^^ nor have a neighbor that belongs to it. The 
idea behind this representation is the following. We assume that there is a Steiner tree 
T in G. The forest T{X^) is obtained by restricting the 1-neighborhood of the tree T to 
the vertices in X^. This restriction of N[T] D X^ is encoded by the configuration /^^. In 
particular, for a Steiner tree T, there is a unique configuration /^^ for each bag Xi that 
is consistent with T (as formally defined later). Overall, each table Xi has at most B^^ 
entries, for /c' = + 1, where = 0{k^\) is the Bell-number [6j, corresponding to the 



32 



number of partitions of at most uj elements into + 1 classes. 

Legal Configuration. Let A{ni) = {u ^ Xi \ ni{u) > 0} denote the set of active vertices 
in the configuration /^^. A configuration /^^ is legal if it satisfies (LI) every terminal u ^ Ui 
is assigned a color in the range tvi{u) G and (L2) N[A{tvi)] H Xi C A^tvi). 

The algorithm proceeds by computing, for each legal configuration /^^ G tb{i)^ a value 
Val{Ki) that represents the minimum number of vertices in X^ that belongs to the neigh- 
borhood of a forest F that respects the configuration. These values are computed bottom- 
up. Before we proceed, we first define the notion of compatible configurations. 

Compatible configurations. Let i be parent of j in the tree T- Then /^^ G tb{i) is 
compatible with i^j G tb{j)^ denoted as /^^ ^ K.j if the following four properties hold: 
(Ql) K^i and k^j agree on their mutual vertices X^ = XiOXj. For any u G X', K.i{u) = iff 
K.j{u) = uj. In addition, for any pair u^v ^ X' that are in the same connected component, 
i^j{u) = tvj{v) G [l,cc; — 1], it holds that u and v are connected in tvi as well. (Note 
that some components in the child j might be connected in the parent i, and thus the 
converse does not hold.) (Q2) If i^i connects two connected components of /^j, then this is 
supported by the graph G edges (i.e., the the edges required for the connectivity exist). 
(Q3) Any terminal of Xj is connected to some connected component in the forest repre- 
sented by hii. Hence, /^^ contains at most \Xi\ connected components and for each terminal 
u G Xj there exists some v G XiD Xj such that i^j{v) = nj{u) and ni{u) G [1,cl; — 1]. 
(Q4) N[A{Kj)]nX,(ZA{K,). 

The value Val^Ki) of the coloring Ki G tb{i) is computed bottom-up. Let Xi be a leaf. 
Then for every legal G tb{i)^ set Val^Ki) = \A{i<ii) \ and Val{tvi) = oc if the configuration 
is invalid. Consider a non-leaf node Xi with children Xi^ and Xi^. Let = (z^^, z^^^, K.i^) 
be a triple of compatible configurations, such that /^^^ G tb{ii) and K.i^ G ^6(^2)- Define 

Val{'i^) = Val{K.i^,K.i) + Val{i^i^, i^i) + \A{i^i)\ , (33) 

where, for j G {^,^2} 

Val{Kj,Ki) = Val{i^j) - \A{i^i) n A{i^j)\ . (34) 



Since {X^^ X^^ Xi) is a separation in the graph G{X^)^ the correctness of Eq. (33) follows 
from Eq. ([32]). Finally, for each entry /^^ G tb{i)^ maintain the minimum cost over any set 
of compatible triples. Define, 

Val{tvi) = min {Val^'i/) \ ki-^ G t6(ii), i^i^ G ^6(^2), f^h ^ ^12 ^ • (35) 
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The optimal solution value for the whole graph can be found in the root bag of T, 
identifying a cheapest solution where all vertices with color 7^ are contained in the same 
connected component (i.e., have the same color). Computing the optimal solution (which 
is sufficiently represented by the set of nodes in the final tree) is possible by backtracking 
or by storing the set of edges for each row and each bag. 

4.0.2 Analysis 

For a given configuration entry Ki G tb{i)^ the forest F agrees with if the following four 
properties hold: 

(Zl) F(X^) has C connected components where C — \{i^{u) G [l^cu — l\^u G Xi}\ is the 
number of distinct colors in the coloring 

(Z2) All vertices u ^ Xi of the same color, Ki{u) are in the same connected component in 
(Z3) The Xi neighbors of the forest are active, N[F] H Xi = A{i<ii). 

(Z4) The terminals are connected in F{X^) in the following manner. Any terminal 
u G \ Xi is in non-singleton component in F. Specifically, it is connected to some 
node V ^ Xi. In addition, the terminals of f/^^ n A^, are in some connected component in 
F. 

For a legal configuration /^^ define the family of all forests in G that agree with /^^ as 

7'(/^.) = {F C G I F agree with i^i} (36) 

Let 

Val{F,Xi)^\N[F]nX^\ (37) 
and Val^'^tvi) = min{V al {F ^ Xi) \ F G J-'{i^i)} be the minimum value of any forest F G 

Lemma 4.2 Val{hZi) = ya/*(/^^). 

Proof: This can be shown by a straightforward inductive proof on the decomposition 
tree. The base cases are leaf nodes where the hypothesis clearly holds. Consider i > 1 
and assume the induction assumption holds for all descendants of bag Xi. Let F* G J^{i^i) 
be the forest that attains the minimum value in the family J-'{i^i)^ i.e., l/a/(F*, A^) = 
Val*{tvi). Let Ai,A2 be the children of Xi in T and let /^i G t6(l),^2 ^ tb{2) be the 
two compatible entries such that F* G J^{i^i) n J-'(/^2), hence A[F*] n Xj — A{Kj) for 
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j G {1, 2, i}. For a forest F G J~'{i^j) and j G {1, 2, i}, define 



Diff(F,X„XO = \N[F] n (X+ 



(38) 



Since (Xi,X2,Xi) is a separation in G{X^) and by Property (Q4) we have the 
foUowing. 

Claim 4.1 Diff{F,Xj,Xi) = Val{F,Xj) - \A{Ki) n A{Kj)\ for every F e T{Kj), where 

je{l,2}. 



In addition, since {X^ , X2 , Xi) is a separation in G{X^) by Eq. (32), it foUows that 

Val{F*,Xi) = DiS{F*,X,,X,) + Diff(F*, X2, X,) + \A{k,)\ . (39) 

For j G {1, 2}, let F* G J'i^j) be the forest that attains the minimum value of Val{F*, Kj) 
in their family, i.e., Var{K,j) = Val{F* , kj). By induction assumption, Val{K,j) = 
Val{F*, Kj). By Eq. (Isl) and Claim O we get that 



Val{Kj, Ki) = Val{Kj) - \ A{Ki) D A{Kj)\ 

= Val{F*, Kj) - \A{Ki) n A{Kj] 
= Diff(i^*,X„X,) , 



(40) 
(41) 
(42) 



where Ineq. (40) follows from Eq. (34), Ineq. (41) follows from induction assumption 



and Ineq. (42) follows from Claim 4.1, Thus since the triple = (/^^, /^i, consists of 



compatible configurations, by Eq. (33), Eq. (34), Eq. (35) and Eq. (40) we get that 

Val{K,) < Val{^) = Difl[(F;,Xi,X,) + Difl[(F2*,X2,X,) + \A{k,)\ . (43) 



Assume for contradiction that Val{i<ii) > ya/*(/^i). Then, by Eq. (43) and Eq. (39), we 
get that there exists j G {1,2} such that 



Difr(F;,x„x,) > Difr(F*,x„x,). 



(44) 



Without loss of generality let j = 1. Combining Claim 4.1 and Ineq. (44), we get 



that Val{F* , Xi) < Val{F^,Xi), and since F* G ^(/^i), we end with contradiction 
to the induction assumption. We therefore get that l/a/(F*,Xi) = Val{F^ ^ Xi) and 
ya/(F*, X2) = Val{F;,X2). The lemma follows. | 



We now complete the proof of Thm. 4.1 
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Proof: Let T = {k.^ ^ tb{r) \ nr{u) G {0, 1,cl;}, V^i G X^} be set of configurations in the 
root r of T, in which aU vertices are in a single connected component. Note that there 
exists a configuration Kr such that the optimal secluded tree T* G J-'{tvr)- Let /^* G T 
be the configuration that attains the minimum value ]/a/(/^*) in the set. It then follows, 
by Lemma 4.2, l/a/(/^*) = l/a/*(/^*) = The Steiner tree can be computed by a 

brute- force backtracking procedure. 

We now turn to running time. The size of any table tb{i) is bounded by 0{(ju\ • u)). 
During the bottom-up traversal of T we consider all possible row combinations for at 
most 3 tables. The merge operation can be done in 0{(jj) time, and hence requires overall 
0{B^^i • uj). All other operations for entry processing are linear in the size of the bag, 
0{uj). Note that computing the values of subsolution tvi are linear in the treewidth. Thus 
our time complexity is of the same as order as that of [6j. In particular, it is linear for 
graphs with fixed treewidth, in this case the tree decomposition can also be computed in 
linear time and hence the total complexity is linear. If TW{G) = 0(logn/ loglogn) then 
given the tree decomposition T for G, the running time is O(n^). | 

Note that the algorithm extends also to directed and node- weighted graphs. We 
now outline the modifications needed for these extensions. The extension to weighted 
graph is immediate. The only modification needed is to define Difl[(74, S)^ Diff(5, S) and 
Cost(T, S') according to the weighted cost as in Eq. ([T]) instead of the unweighted cost, 
and apply a similar analysis. 

The extension to directed graphs is a bit more involved. Note that in the directed case 
it is not enough to store configurations that represent the different connected components 
since in order to construct a valid tree we need to make sure that every node has only 
one parent. Therefore it is important to "keep track" of the roots of these connected 
components. For each connected component C we store as part of the configuration the 
root r{C) of this connected component. Namely, we add to each configuration a root for 
each color. In order to merge two connected components Ci and C2 to single tree with 
root we only consider edges from r{C2) to Ci. However, note that the root of a 

component does not necessarily have to be from the current bag and hence the number of 
different configurations might be too large. To overcome this issue we use the following 
observation. Consider a bag Xi and a configuration /^^ G tb{i). Note that it cannot be that 
two different connected components in /^^ would have both roots that are not from Xi. To 
see this, consider a subtree whose root r is not in X^, since X^ is a separator then r does 
not have edges to nodes in 1/ \ X^ forcing r to be the root of the final tree. Therefore, 
there could be at most one such component. We thus restrict ourselves to configurations 
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where there is at most one connected component with a root not in Xi (otherwise the 
configuration is invahd). Notice that the number of vahd configurations might increase 
by at most a factor of n (for aU options of the root not in Xi). Hence, the running time 
increases by a factor of n. The rest of the analysis is similar and thus is omitted from this 
version. 

Bounded Density Graphs. Let DegCost*(5) = min{DegCost(r) | T G T{S)}. We 
show the following. 

Proposition 4.3 Let Q be a hereditary class of graphs with a linear number of edges^ i.e., 
a set of graphs such that for each G ^ Q , \E{G)\ < l-\V{G) \ for some constantly and where 
G eg implies that G' e Q for each subgraph G' ofG. Then DegCost*(5) < 21 • g*(5). 

Proof: Note that for every Steiner tree T in G, it holds that 

Cost(T) < DegCost(T) < 21 • Cost(T). (45) 

The left inequality is immediate. We now prove the right inequality. Let N = N\T] be 
the neighborhood of T and let G{N) be the subgraph of G induced by and E{N) be 
its edge set. We have that 

DegCost(r) = 2\E{N)\ < 2i\N\ = 2£ • Cost(T) , 

where the first equality follows by definition, the second inequality follows from the hered- 
itariness of GG and the third equality follows by the definition of Cost(T). Hence, Eq. 



45] holds. In particular, for the optimal secluded tree T*, we get that 

Cost(T*) = g*(5) < 2£ • DegCost(r*) < 2£ • DegCost*(5). 

The proposition holds. | 

An example of such a class of graphs is the family of planar graphs. For this family, 
the above proposition yields a 6-approximation for the Secluded Path problem. (A more 
careful direct analysis for planar graphs yields a 3-approximation, however) Finally, note 
that whereas computing DegCost*(5) for a constant number of terminals k is polynomial, 
for arbitrary /c, computing DegCost*(5) is NP-hard but can be approximated to within 
a ratio of B(log/c) thanks to Claim [ZSj Thus, for the class of bounded density graphs, 
by Prop. |4.3[ the Secluded Path problem has a constant ratio approximation and the 
Secluded Steiner Tree problem has an B(log/c) ratio approximation. 

Theorem 4.4 For the class of bounded- density graphs^ the Secluded Path (respectively ^ 
Secluded Steiner Tree ) problem is approximated within a ratio of 0(1) (resp., ©(log k) ). 
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